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Abstract 

The article is devoted to the nonparametric estimation of the quadratic covariation of non-synchronously 
observed Ito processes in an additive microstructure noise model. In a high-frequency setting, we aim at 
establishing an asymptotic distribution theory for a generalized multiscale estimator including a feasible 
central limit theorem with optimal convergence rate on convenient regularity assumptions. The inevitably 
remaining impact of asynchronous deterministic sampling schemes and noise corruption on the asymptotic 
distribution is precisely elucidated. A case study for various important examples, several generalizations 
of the model and an algorithm for the implementation warrant the utility of the estimation method in 
applications. 
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1. Introduction 

The nonparametric estimation of the univariate quadratic variation of a latent semimartingale from n 
observations in a high-frequency setting with additive observation noise has been studied intensively in 
recent years. It is known from |,13] that n^^ constitutes a lower bound for the rate of convergence. An 
important motivation which has stimulated an alliance of economists and statisticians to establish estima- 
tion techniques for this kind of latent semimartingale models is their utility in estimating daily integrated 
(co-)volatilities from high-frequency intraday returns that serve as a basis for risk management as well 
as portfolio optimization and hedging strategies. The last years have seen an enormous increase of the 
amount of trading activities for many liquid securities. Paradoxically, the availability of high-frequency 
data necessitated a new angle on financial modeling. In fact, for every semimartingale the discrete realized 
(co-)volatilities converge in probability to the integrated measures. However, realized volatilities of typical 
high-frequency financial time series data explode for very high frequencies. This effect is ascribed to mar- 
ket microstructure frictions. Sources of the market microstructure noise are manifold. One important role 
plays the occurrence of bid-ask spreads. Aside from that transaction costs, strategic trading, limited market 
depths and discreteness of prices spread out the structure of the long-run dynamics that can be character- 
ized by semimartingales. 

This strand of literature followed ||3T1 that has attracted a lot of attention to this estimation problem. The 
so-called two-scales realized volatility by OTI is based on subsampling and a bias-correction and a sta- 
ble central limit theorem with n^/^-rate has been proved. A refinement of the subsample approach using 
multiple scales in ll30l and related alternative techniques in ID, ||231 and ll29l have led to rate-optimal 
estimators and feasible stable central limit theorems. For the more specific nonparametric model with 
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Gaussian noise, asymptotic equivalence in the Le Cam sense to a Gaussian shift experiment is shown in 
|j26l and an asymptotically efficient estimator whose asymptotic variance equals the parametric efficiency 
bound is constructed. 

In the article on hand we are concerned with a multivariate stetting and apart from taking additive mi- 
crostructure noise into account, we we focus on a way to deal with non-synchronous observation schemes. 
This is also a central theme in financial applications. When realized covolatilities are calculated for fixed 
time distances and a previous-tick interpolation is applied, the phenomenon of the so-called Epps effect 
described in flO] appears that the realized covolatility tends to zero at the highest frequencies. 
A methodology to deal with non-synchronous observations in a bivariate Ito processes model has been 
proposed by ifTSl . The so-called Hayashi-Yoshida estimator has superseded simpler previous-tick inter- 
polation methods setting the standard for the estimation of the quadratic covariation from asynchronous 
observations in the absence of microstructure noise effects. 

Our estimation approach, first proposed in IS], for the most general case in the presence of noise and non- 
synchronicity arises as a combination of the multiscale estimator to handle noise contamination on the one 
hand and a synchronization algorithm in accordance with the Hayashi-Yoshida estimator to cope with non- 
synchronicity on the other hand. A first attempt in the same direction, combining one-scale subsampling 
and the Hayashi-Yoshida estimator, has been given in f22\. 

In ID it has been shown in the spirit of [ 13 1 that the optimal convergence rate n^^'^ carries over to the general 
multidimensional setup. The mathematical analysis of our generalized multiscale estimator in |[8l shows 
that it is rate-optimal. 

Alternative approaches to similar statistical models has been suggested by lH], |l9l and ||T|. In IS) a kernel- 
based method with a previous-tick interpolation to so-called refresh times is proposed and a stable central 
limit theorem with sub-optimal n^/'^-rate is established for a multivariate non-synchronous design. This 
estimator, furthermore, ensures that the estimated covariance matrix is positive semi -definite. |9| and IT] 
come up with combinations of pre-averaging (|23 |,|19|) and the Hayashi-Yoshida estimator and of the 
univariate quasi-maximum-likelihood method by |29|, the polarization identity and a generalized synchro- 
nization scheme which is different from the Hayashi-Yoshida ansatz that we use, respectively, both also 
attaining the optimal rate. 

In this article we aim at providing an asymptotic distribution theory for the generalized multiscale estima- 
tor In distinction from alternative methods, the influence of non-synchronicity effects on the expectation 
is null and on the variance limited up to an interaction of interpolation steps and microstructure noise. The 
main result is a feasible stable central limit theorem for its estimation error with optimal rate and a closed- 
form asymptotic variance that does not hinge on interpolation errors in the signal term. The stable weak 
convergence of the estimation error to a centred mixed Gaussian limit and the consistent estimation of the 
random unknown asymptotic variances are the essential steps towards statistical inference and confidence 
sets. The theory is grounded on stable limit theorems for semimartingales from flSl . 
In Section [2] we present the model and our main findings. Section [3] comes up with a concise overview 
on the construction of the estimator and in Section |4] we develop the asymptotic theory. In Section |5] we 
propose a consistent estimator for the asymptotic variance and Section[6]comprises various extensions and 
and a concluding discussion. The proofs are postponed to the Appendix. 

2. Model and key result 

The considered statistical model of noisy latently observed Ito processes at deterministic observation 
times is precisely described by Assumptions [T]|3]in this section. 

Assumption 1 (efficient processes). On a filtered probability space (fl, T ^ {-Ft) , ]P)> the efficient processes 
X = {Xt)teB.+ ond Y = {Yt)t£^+ are Ito processes defined by the following stochastic differential 
equations: 

dXt = ^J.^ dt + dBf , 
dYt ^ dt + aj dSY , 

with two (J-t)— adapted standard Brownian motions and B^ and ptdt — d [i?^, B^^^- The drift 
processes and fiY '^''^ {J^t)-adapted locally bounded stochastic processes and the spot volatilities 
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(J I and a I and pt are assumed to be [Ftj-ddapted with continuous paths. We assume strictly positive 



volatilities and the Novikov condition E 



exp 



< oo for X and Y. 



Assumption 2 (observations). The deterministic observation schemes 7"'^'" = {0 < ig"' < t J"-* < . . . < 

f^""-* < T} of X and 7"^'™ = {0 < Tq"^^ < r}™' < . . . < t,!™'' < T} of Y are assumed to be regular in 
the following sense: There exists a constant < a < 1/9 such that 

S^- snp^^{{t^r^-tt\),t^^\T-t^-^) =0(n-«M, (la) 



J^=^^^sup^^((rf)-rjr^),rr\T-ri™)) =0(m-V^^^^ . (lb) 

We consider asymptotics where the number of observations of X and Y are assumed to be of the same 
asymptotic order n — 0{m) and m = 0{n) and express that shortly by n ^ m. The efficient processes 
X and Y which satisfy Assumption^are discretely observed at the times f-^'^ and 7"^'™ with additive 
observation noise: 

X(„) = / pf dt+ fjf dB^ + e^„) , < i < n , (2a) 
Jo Jo *' 



F (,„) = / K dt+ al dBf + e^(„) , < j < m . (2b) 

"^j Jo Jo "^J 

Although we consider sequences of deterministic observation times, the case of random sampling that 
is independent of the observed processes is included when regarding the conditional law. 
It turns out that it is accurate to prove the key result of the article on the following i. i. d. assumption on 
the microstructure noise since a closed-form expression for the asymptotic variance is not available for 
a combination of general asynchronous observation schemes and serially dependent observation errors. 
Since an extension to non-i. i. d. noise is crucial for the utility in financial applications, we comment on the 
robustness of our estimator to that case in Section |6] 

Assumption 3 (microstructure noise). The discrete microstructure noise processes 

e;^;),e^(„),0<z<7i,0<j <m. 

are centred i. i. d. , independent of each other and independent of the efficient processes X and Y. We 
assume that the observation errors have finite fourth moments and denote the variances 



r;i=Var(e|„,) , jyf- = Var (e^<„,) 



The number of synchronized observations N ^ n ^ m which appears in the rate of our feasible stable 
central limit theorem is introduced in Section[3] 

Tlieorem 1 (feasible stable central limit theorem). The generalized multiscale estimator dl2] i specified 
by the later given weights ( |A.l| l, with Mn = Cmuiu • converges on the Assumptions\l\\2\ p\and further 
mild regularity conditions on the asymptotics of the sampling schemes, stated below in Assumptions^and 
1^ J-— stably in law with optimal rate N^^* ^ nl'^ ~ m^'^ to a mixed Gaussian limiting distribution: 

/ multi \ f 

TVV* [X, Y]^ - [X, F]^ 4 N (0, AVAR,„„,,,) 



wi 



th a almost surely finite random asymptotic variance given in ( |18|l in Theorem |5] With the consistent 



estimator for the asymptotic variances AVARmuiu in Proposition 5.1 , the feasible central limit theorem 



— - — — . multi 

[X,Y]^ -[X,Y], 



n''- ^ N(0, 1) , (3) 

AVAR„ 



Iwlds true. 
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The notion of stable weak convergence going back to [27] is essential for our asymptotic theory. Stable 

st 

weak convergence X„ X is the joint weak convergence of (X„ . Z) to {X, Z) for every measurable 
bounded random variable Z. The limiting random variables in stable limit theorems are defined on exten- 
sions of the original underlying probability spaces. The reason for us to involve this concept of a stronger 
mode of weak convergence is that mixed normal limiting distributions are derived where asymptotic vari- 
ances are themselves strictly positive random variables. Provided we have a consistent estimator for 

such a random asymptotic variance on hand, the stable central limit theorem Xn ^ VZ with Z dis- 
tributed according to a standard Gaussian law, yields the joint weak convergence {Xn, V^) {VZ, V^) 
and also Xn/Vn Z and hence allows to perform statistical inference providing tests or confidence inter- 
vals. 

In the proofs of our limit theorems we will 'remove' the drifts in the sense that after a transformation to an 
equivalent martingale measure stable central limit theorems for Ito processes without drift are proved and, 
as illustrated in [21], stability of the weak convergence ensures that the asymptotic law holds true under 
the original measure. In this sense stable convergence is commutative with measure change. 
From now on, we often omit the superscripts of observation times for a shorter notation. 



3. Brief review on the foundation 

3.1. Subsampling and the multiscale estimator 

In the model imposed by Assumption^] Assumption |2] with synchronous observations, n = m and 
tj"'' = 1 < J < and Assumption pTthe realized (co-)volatilities do not provide consistent estima- 
tors for the quadratic (co-)variations any more. The variance due to noise conditional on the paths of the 
efficient processes 

increases linearly with n. The error due to noise perturbation can be reduced by the following estimator, 
which has been proposed for the univariate estimation of integrated volatility as the "second best approach" 
in II3TI and which is called one-scale subsampling estimator in |8| and throughout this article. It can 
be motivated from two perspectives that are both sketched in Figure [T] On the left-hand side we have 
visualized that one can calculate simultaneously lower frequent realized covolatilities using subsamples, 
e. g. to the lag three in Figure[T[ and (post-)average them. 

b 1 " 
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This motivation given in OTII is in line with the former common practice of a sparse-sampled low-frequency 
realized (co-)volatility estimator and proposes to use an average instead of one single lower frequent real- 
ized measure. 

The same estimator arises as the usual realized covolatiUty calculated from the time series on that a linear 
filter is run before, what means that non-noisy observations at a time tj are estimated with a (pre-)average 
of noisy observations at times tj, . . . , tj^i for some i. This is sketched on the right-hand side of Figure [T] 
for i ~ 3. Passing over to increments leads to telescoping sums and we end up finally with the one-scale 
subsampling estimator. 

Since on the Assumption |3] there is no bias due to noise for the bivariate estimator, it already corresponds 
to the "first best approach" from OTl whereas in the univariate case a bias-correction completes the two 
scales realized volatility (TSRV): 

j=i i=l 

There is a trade-off between the signal term and the error due to noise. Choosing i = Csuhnl^ dependent 
on n with a constant Csub^ the overall mean square error is minimized and of order n~^l^ . The one- 
scale subsampling estimator ( |4a] l is hence a consistent and asymptotically unbiased estimator. The rate of 
convergence n^/", however, is slow and does not attain the optimal rate nl^ determined in lO. For this 
reason, we focus on a multiscale extension of the subsampling approach on which the methods developed 
in [8] are based on. The multiscale realized covolatility (MSRC), and the univariate multiscale realized 
volatility (MSRV) introduced in 1301 . are Unear combinations of one-scale subsampling estimators with 
Mn different subsampling frequencies i = 1, . . . , Af„: 

,,. -fl/jv opt n 
multi — ^ OL- , f _ — , / _ _ \/~ ~ \ 

[X, ^Y.^Y. (^*. - (^*. - ' (5a) 

,,. Mt^ opt n „ 
^ multi ^ — ^ a,- J r ^ — , / - - \ ^ 

=E^E(^*.-^*.-.) ■ (5b) 

1=1 i=i 

The weights are chosen such that the estimator is asymptotically unbiased and the error due to noise min- 
imized. They are given later in and can be chosen equally for the bivariate and the univariate case. 
Those are the standard discrete weights of ll30l and we abstain from giving a more general class of possible 
weight functions. 

The mean square error of the multiscale realized covolatility (|5a]i can be split in uncorrected addends that 
stem from discretization, microstructure noise and cross terms and end-effects. They are of orders A/„/n, 
n/M'^, and A/^^, respectively. Hence, a choice Af„ = Cmuiti\fn leads to a rate-optimal ?i ^/''-consistent 
estimator. 

The following stable central limit theorems for the multiscale realized covolatility (|5a| and the one-scale 
estimator ( |4a| i are implied by Theorem[3]and Corollary 4.1 



Proposition 3.1. On Assumption^A ijaMc/jij/n the synchronous setup and if {n/T) ^^{t^^^^ — ^^"''1)^ con- 
verges to a continuously differentiabie limiting function G and tlie difference quotients converge uniformly 
to G' on [0, T], the multiscale realized covolatility ( |5a| l and the subsampling estimator ( |4a| l converge stably 
in law to mixed normal limiting random variables: 



1/4 



multi I ^ 

[X, Y]j, - [X, r] J, U4 N (0 , AVAR^uiu.syn) , (6a) 



-sub I f 

[X, Y]j, - [X, r] J, p4 N (0 , AVAR,„b,,y„) , (6b) 
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with 

26 

AVARmuiu,syn = c;^liu^4.rjx Vy + CmuiUT^T / G'(t)(p2 + l){a^aYf dt (6c) 

Jo 

+ c'^\iu^{^Wy+vI {crjfdt + ifyj^ {a^ fdt^ , 

AVARsub,syn = cJ^b'^VxVY + Csub^T f G' {t){pl + l){a^<jYf dt . (6d) 

-J Jo 

3.2. Synchronization and the Hayashi-Yoshida estimator 

We use the short notation AXt. ,i = 1, . . . ,n from now on for increments Xt. — Xt._-^ and analogously 
for Y. The Hayashi-Yoshida estimator 

[X,Y]j, ^ y y ^ AXt^Ayr^l[min(t,,r,.)>inax(t._i,T,_i)] , (7) 

i=l j = l 

where the product terms include all increments of the processes with overlapping observation time in- 
stants, has been proved in 1 15| to be consistent in a model of asynchronously observed Ito processes with 
deterministic correlation, drift and volatility functions in the absence of observation noise and on further 
regularity conditions to be asymptotically normally distributed in lfT6l . 

For a combination of the strategy of the Hayashi-Yoshida estimator with techniques to handle noise conta- 
mination, we use an iterative algorithm introduced in [22} as 'pseudo-aggregation'. Incorporating telescop- 
ing sums there are the following rewritings of the estimator (|7]l: 

_ — - — -{HY) " 

[X,Y]^ ^Y.AXu{Yu^^-Yu_,^_) 

i=\ 
N 



^ (X,, - X J (r^, - n J (8) 

i=l 
N 



with the notion of next-tick interpolated times i^ ^. :— mino<j<TO (tj |tj > ti) and previous-tick interpo- 
lated ones <i._ := maxo<j<TO {Tj\Tj < ti) in the first equality. This rewriting can be as well done in the 
symmetric way. 

The illustration of ([8]l that serves as a basis for the construction of the generalized multiscale estimator 
relies on an aggregation of the observations according to Algorithm [T] This algorithm, which is a concise 
version of the construction in |8 |, stops after (iV+1) < min {n, m) + l steps when all observation times are 
grouped. Summation in ([8| can start with i — or i — 1. In the last equality only the denotation expres- 
sions gi,'-fi,li, Xi are substituted emphasizing that those sampling times obtained by Algorithm [T] can be 
interpreted as previous- and next-tick interpolations again with respect to a synchronous sampling scheme 
Tfe := min(gfc,7s.), 1 < fc < A^, which we call the closest synchronous approximation. Increments in 
(|8]l are taken from previous-tick interpolations at left-end points of instants [Tk-i,Tk], 2 < k < N to 
next-tick interpolated sampling times at right-end points. Since Tj, = max{lk+i, Xk+i), 1 < < 
holds true, we spht the estimation error of (|8]l in two uncorrected parts + with 



— I ptaf aY dt — / pta^ aY dt 

Jo Jt„/\Tm 
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Define t+{s) := mm{U G T^'^lU > «}, t+(s) min{T, G T'^'"|r, > s}; 

t-{s) := ma.x {U E T^-"\ti < s}, t- (s) := niax{T, £ T'^'"|t,- < s}. 

first step: 

• For to < To 50 = t+{To),lo = ^o, 7o = tq, Aq = tq. 

• For to > To 50 =to,lo = ^o, 7o = T+(to), Ao = To. 
ith step (given gi-i and 7i_i): 

• If 5i-i = 7i-i 

- and t+(5i_i) < t+(7j_i) 5^ = t+(r+(7i_i)), = g^^i, 7^ = r+(7i_i), = 7j_i. 

- and t+(5i_i) > t+(7j_i) 5^ = t+(5.j_i), = 5i_i, 7^ = r+(i+(5j_i)), = 7^-1. 

• If 5j-i < 7i_i 

- and7,,_i < t+{gi^i) g., = t+{gi^i),k = g%-i,li = r+(t+(.gj_i)), A^ T"(7i_i). 

- and 7i„i > t+(5i_i) — > 5.^ = t+(7i_i), = 5i_i, 7^ = 7i_i, A^ = (7j_i). 

• If 5i_i > 7i-i: symmetrically as for 5^-1 < 7^-1. 



Algorithm 1 : Iterative synchronization algorithm. 

being the discretization error of a synchronous-type realized covolatility including in general non-observable 
idealized values at the times of the closest synchronous approximation and an additional error due to 
the lack of synchronicity, in particular next- and previous-tick interpolations. The times Tk equal the 
so-called refresh times of |5| and thus our synchronization differs from the one in |5J by replacing pure 
previous-tick interpolation by the above given machinery of previous- and next-tick interpolations. 
The asymptotic theory for the estimator (|8]l as — 00, concisely repeated here, is separately proved and 
presented in a more elaborate way in 

First, we take up the illustrative example from ||8l to motivate the synchronization procedure. For further 
details and examples we refer to |8| and y2!T\. Figure|2]visualizes the aggregation carried out by Algorithm 
[Tjand the times T^, i — 0, . . . , 8 for a toy example. The example emphasizes the important fact that con- 
secutive right-end points of increments can be the same time points. The realized covolatility calculated 
from previous-tick interpolated values to refresh times equals 

{X,, - X,J{Y,, -Y,J + {Xt, - Xt,){Yr, - Y,J + (X,, - X,,){Y,, -Y,,) + 

{Xtg - Xt^){YT-^ -Yt^) + {Xtu-, - Xtg){Yr^g - Yr^) 

and is systematically biased downwards by interpolations, whereas ([H) yields 

(x,3 - x,j(y,, - i;j + (x,3 - Xt,)(Y,, - + {x,, - x,,){y,, - 
{Xt, - Xt,){Yr, - i;j + {Xt, - Xt,){Yr, - y.J + [x^, ~ - 

which is not biased due to interpolations. 

Definition 1 (quadratic (co-)variations of time). For any e IN let t'>'^\ i ~ 0, . . . , iV be the times 
of the closest synchronous approximation and 5^- , 7^^^ , l^^'' , A^^^ the corresponding observation times 
that appear in the estimator (|8]l defined above by Algorithm [T| . T/N is the mean of the time instants 
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Figure 2: Example for non-synchronous sampling design with the sets constructed by the synchi'onization algorithm, interpolated 
observation times occurring in the estimators and the synchronous approximation. 



, N. Define the following sequences of functions 



T 



(10a) 



T'"'<t 



(AT) _ y(JV) 



Ty:'<t 



(lOb) 



for i e [0, T] that we call sequences of quadratic (co-)variations of times. 

A stable central limit theorem for the estimation error is deduced in |7| on the assumption that the se- 
quences defined by ( |10a| l, \\Qh\ and \\Qc\ converge pointwise to continuous differentiable limiting func- 
tions G, F, H and the sequences of difference quotients uniformly. The asymptotic quadratic variation of 
time G of the T^'^^ influences the asymptotics of . The covariation of times measures an inter- 
action of interpolation errors between the two processes and the impact of the in general non-zero 
correlations of the products involving previous- and next-tick interpolations at the same t/^^s for each 
process separately. 

Consider as easiest example the synchronous equidistant sampling schemes with N = n = m and 
t|"'' — rj"'' = i/n, i = 0,...,n. In this case and H'^ are identically zero since interpolations 
are redundant. The function is a step function that will tend to the identity on [0, T] as N 00. 
Then, consider a situation of completely non-synchronous sampling schemes that originates from the com- 
plete synchronous equidistant one by shifting one time-scale half a time instant 1 /2N. We will call this 
situation intermeshed sampling. For this example, the synchronous approximation is still equidistant with 
instants and, hence, G is the identity function. F and H are linear limiting functions with slope 1 and 
1/4, respectively. 

In Q we show for an important special case, independent homogeneous Poisson sampling, that the conver- 
gence assumptions on ( |10a[ )-( [T0c] l are fulfilled when replacing deterministic convergence by convergence 
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in probability. Furthermore, the stochastic Hmits G' {t) , F' (t) , H'{t) are calculated explicitly. 
The main result for the estimator ([8]l is Theorem[2] It serves as preparation to prove the stable limit theorem 
for the generalized multiscale estimator in Theorem[3]and gives insight into the asymptotic distribution of 
(|8]l. For the proof we refer to Q. A similar stable limit theorem for the original Hayashi-Yoshida estimator 
is provided in ifTTI . 

Theorem 2. The estimation error of ([8]) converges on the Assumptions^^and convergence assumptions 
on ( |10a| l-( [T0c| ) and the difference quotients stably in law to a centred, mixed Gaussian distribution: 

'^(Y.i^a.~^h)iY^,~Yx,)-[X,Y]^^ ^N{0, VD^+VA^) , (11) 

with the asymptotic variance 

VD^+VA^ =Tj^^G'{t){a^ajf{pl + I) dt + T (f' {t) {af oj f + 2H' {t){p,a^ ajf) dt 

where the two addends come from the asymptotic variances of and A^, respectively. 

3.3. Hybrid approach to non- synchronous and noisy observations 

In Is) we have proposed the following combined estimation method for the quadratic covariation or in- 
tegrated covolatility from noisy asynchronous observations. After applying Algorithm[T]to the observation 
times, the generalized multiscale estimator is defined by 

j\/jv opt N 

[X, Y]^ = E E (^.f - ij-/r " ^A<^^, J • (12) 
1=1 j=i 

It is a weighted sum of Af^r one-scale subsampling estimators of the type 

sub I ^ / . \ / . \ 



with subsampling frequencies i = 1, . . . , Mjv and optimal weights given later in ( |A.l| i. Owing to the ag- 
gregation of non-synchronous observation times before applying subsampling and the multiscale approach, 
the resulting estimator has a conformable appearance as in the synchronous case (|5a|. Recall that in the 
synchronous setting gj = jj = Tj and Ij-i+i = Xj-i+i = Tj^i holds. 

Choosing Mn = Cmuiu ■ and in = Csub ■ N^^^, both estimators above provide consistent and asymp- 
totically unbiased estimators with convergence rate N^^* and N^^*^, respectively. 



4. Asymptotics and a stable central limit theorem for the generaUzed multiscale estimator 

A comprehensive analysis of the asymptotic distribution of the estimation error necessitates an elaborate 
screening of the conjunction of Algorithm [T| and the joint sampling design (T'^'", T^'™). 
Note that the generalized multiscale estimator ( [T2] i differs from the other plausible Hayashi-Yoshida version 
of a multiscale estimator 

E ^E E (^t<") - ^.<7<'.-., J^--(*n--]T('.-i,.)<--(*r'^-<rL)} - ^^^^ 

which arises as natural Hayashi-Yoshida multiscale estimator when, on the basis of (non-synchronized) 
observations of X and Y, sparse-sample Hayashi-Yoshida estimators are averaged to one-scale subsample 
estimators and those extended to a linear combination using different time lags. We state without proof 
that this estimator is consistent, asymptotically unbiased and will attain the optimal rate of convergence. 
Nevertheless, we benefit from the data aggregation method and applying subsampling to the synchronized 
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relations X 




52 = ^3 


53 = ^5 


54 = ^6 


55 = ^7 


56 = '7 


57 = ^8 




relations Y 


71 = A2 


72 = A3 


73 = A4 


74 = A5 


75 = Ae 


76 = As 







Table 1: Allocation of sampling times to cases ® — (3) for the example. 



scheme, since the variance of our estimator ( |T2] i is smaller than the one of this alternative estimator and we 
are able to find a feasible closed-form expression of the asymptotic variance. 

The crucial difference between both approaches is that for the alternative method next- and previous-tick 
interpolation errors take place on sparse-sampling time intervals in average of order i/N whereas the 
interpolation errors of the generalized multiscale estimator ( [T2] l take place on the highest-frequency-scale 
and hence on intervals in average of order In particular the decomposition 

X IN) — X (N) = (X IN) — Xrp IN) +Xrp (N) — X (N) + X (N) — X (N) ) 
^ ^ ^ V ' ^ V ' " V ' 

=Op(Af-i/2) =Op((i/Ar)(i/2)) =cij,(Ar-i/2) 

of the increments of X and analogously for Y, give an heuristic that the interpolation errors driving the 
error due to non-synchronicity asymptotically not affect the variance of the signal term. The stochastic 
orders are given for times instants of average order N^^. 

For a rigorous clarification of the asymptotic error due to noise and the cross term, both influenced by the 
i.i. d. observation errors at times g,, l^, ^i,Xi, we figure out the times gi = gi^i and the right-end points 
t/l''^'' = l^^l , g'^'' = that are as well preceding left-end points and analogously for the sampling times 
off. 

All observation times 7^, are characterized through one of the following four mutually exclusive cases. 
Denote 7^ _ the last observation time of Y before 7^ and 7j__|_ the first one after jj. We illustrate the 
allocation of the observation times for 7"^^'" and 7^ , j = 1, . . . , — 2: 

® 7j < 9] ^ 7i ^ 7j+i , 7j = Aj+i , 7j ^ , 

@ 7j > 9j , 1] > 9j,+ ^ 7j = 7j+i , 7j 7^ A^+i , 7^- = Aj+2 , 

® 7j > 5j , 7j < 9j,+ , 7j\+ > 9].+ ^ Ij + 7j+i , 7j + Aj+i , 7^ = A^+a , 

® 1] > 9j , 1] < 9j,+ , 7j\+ < 9].+ Ij + 7j+i , 7j + Aj+i , 7^ 7^ Aj+2 , 7j,+ = Aj+2 . 

Only sampling times distributed to case @ lead to repeated 7^ = 7i+i. In cases Q, @ and (3) a subsequent 
left-end point Afc,fc = z + lorfc = i + 2of observation time instants incorporated in the subsampling 
estimators is designated by 7^. All other A^, k ^ 2, . . . , N appear in an allocation of sampling times of the 
type (3), where Aj-1-2 = 7j,+ 7^ 7; VZ. Recall that A^ 7^ A^ for all z 7^ fc holds true. 

If holds for 7j with fixed j e {1, . . . , - 2} and if fc := a.rgmmke{j,...,N~i}{lk > 9k,lk> 9k,+) 
exists, then holds necessarily for one 5;, ^ e {j + 1, . . . , fc — 1} or (7; = 7;. 
In Table[T]we list the relations for the sampling design of our previous example. 

Assumption 4 (asymptotic quadratic variation of time). Assume that for the sequences of sampling 
schemes and the times T^'^'^ of the closest synchronous approximations and for the sequence of quadratic 
variations of time (t) defined in Definition^ the following holds true: 

(i) (t) — >■ G{t) as N ^ oo, where G{t) is a continuously differentiable function on [0, T\. 

(ii) For any null sequence (/lAf), — O (A^~^) 

uniformly on [0, T] as N ^ oo.. 
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(Hi) The derivative G'(t) is bounded away from zero. 



Definition 2 (degree of regularity of asynchronicity). For iV e ]N and sets W ,Q\i = 0, . . . , iV con- 
structed from aggregated sampling schemes q'^^n q-Y,m fulfill Assumption [2] define the following 
sequences of functions: 



7r'<* 

which describe the degree of regularity of asynchronicity between observation times 7""^ " and T"^ ™. 

In the completely asynchronous case, we can directly conclude that — ly {'^)\ ^ T/N for all 

t e [0, T] and one sequence suffices to reflect the regularity of the non-synchronous sampling schemes. 

Assumption 5 (asymptotic degree of regularity of asynclironicity). Assume thatfor the sequences of 
sampling schemes and for the sequences of functions , ly defined in Definition^ the following holds 
true: 

(i) Ix{t) Ix{t)i ly i^) ~^ ^vit) as N 00, where Ix{t), Iy{t) are continuously differentiable 
functions on [0, T]. 

(ii) For any null sequence (hpf), ~ O (N^^^ 

llN 



n-N 

uniformly on [0, T] as N ^ 00. 

For both, synchronous and intermeshed sampling which have been introduced in the last section, the 
sequences of functions ly are identically zero. The functions defined in Definition |2] are non-negative 
and bounded above by 1. In Section |6] we explicitly deduce the asymptotic degree of regularity of asyn- 
chronicity for mutually independent homogeneous Poisson sampling schemes. The term (asymptotic) de- 
gree of regularity of asynchronicity has been chosen since Assumption [5] holds for all non-degenerate se- 
quences where observation times conforming to one of the cases Q — @ from above tend to be distributed 
according to some regular pattern and it gives information on the interaction of allocations of observation 
times. 

It is interesting and might seem surprising at first glance that the asymptotics of the estimator ( [T2] i hinges 
on this asymptotic feature whereas, as indicated before, the asymptotic interpolations to the closest syn- 
chronous approximation are asymptotically immaterial. This circumstance is caused by the fact that for the 
construction of an estimator with Algorithm[T] as for the original Hayashi-Yoshida estimator ([H), observed 
values of the processes at next-tick interpolated observation times can appear twice. If there is observa- 
tion noise, the number of observations allocated conforming to case @ has an impact on the asymptotics. 
The influence of interpolations is asymptotically vanishing for the combined method in contrast to the 
estimator (|8]l with faster convergence rate ^/N since interpolation steps take place on the time-scale of 
high-frequency observations, but lower-frequency sparse-sampled increments of the synchronous approxi- 
mation are involved to reduce the error due to noise. We continue with the central result of this article: 
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Theorem 3 (Central limit theorem for the generalized multiscale estimator). On the AssumptionsU\ |2] 
jij j^jflnt/jjj the generalized multiscale estimator \12) with noise-optimal weights Q;°^^ — (12i^/Afj^ — 

{6i/Mff) (1 + 0(1)), that are explicitly given in ( |A.1[ ), and = Cmuiu • converges J^— stably in 
law with optimal rate N^^* to a mixed Gaussian limiting distribution: 

/ multi \ f 

N'l' f [X, Y]^ - [X, Y]A^n (0, AVARwtO 
with the asymptotic variance 

AVAR,„„,,, = c„t;*. (24 + 12 (/^(T) + Iy{T))) vWy +Cum 

=AVAR,„„„, 

26 

+ Cmuiu^T G'{t){a^aYY{l + pi)dt (18) 
35 ./n 



=AVARd 



=AVARc 



The weak convergence is proved to be stable with respect to the cr-algebra T associated with the ef- 
ficient processes. As a side result, we also obtain a stable central limit theorem for a simpler one-scale 
subsampling estimator: 

Corollary 4.1 (Central limit theorem for the one-scale subsampling estimator). On the Assumptions 
the oTie-scQ.le subsGffipling estiTnutor with sub SQ-inplifig frequency — Csub 

■ N ' ^ converges 

J^-stably in law with rate N^^'^ to a mixed Gaussian limiting distribution: 



4N(0,AVAR,„b) , (19) 



with the asymptotic variance 

AVAR,„b = c;„2, 477i77f, + c.^b \t G' {t){a^ aj f {I + pi) dt . (20) 



=AVAR„ 



=AVARd 



For the proof of Theorem|3] we split the total estimation error of the generalized multiscale estimator in 
three asymptotically uncorrected addends due to noise, cross terms and the signal term. For the one-scale 
subsampling estimator we follow the same ansatz. The orders of the errors have been derived in ||8j| and we 
focus on the asymptotic distribution here. 

The error due to microstructure noise of the one-scale subsampling estimator has expectation zero and the 
variance yields 



ANi-^^r,lr^l+o{NiJ,^) , 



since observation noises of X and Y are independent of each other by Assumptionjsjand Zj. 1^ for k ^ r, 
Xk ^ Xr for k ^ r and if ~ 9k+i ^ Ik < 7fe+i , < fc < (A^i). Hence, the error due to noise is a 
sum of uncorrected centred random variables with equal variances and the standard central limit theorem 
applies. 

For the generalized multiscale estimator, we further decompose the error due to noise in a main part of order 
N^^'^M'^^^^ and two terms due to end-effects of orders M^^^^, where all three terms are asymptotically 
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uncorrelated. In Propositions ! A.l| and | A.2| we prove central limit theorems for these terms. Asymptotic 
normality holds both, conditionally and unconditionally on the paths of the efficient processes. 
The error due to noise of the one-scale estimator does not depend on any further influence of the sampling 
schemes except the number of constructed sets N and G'. Cross terms are, in contrast to the multiscale 
case, asymptotically negligible since 



E 



N 



'AT 



3=tN 



N 



9j 



7j 



(aff dt 2Tfy + / {ajf dt 2ry| + o {ij,^) 



Oii 



N 



o(l) 



— 1/2 

For the generalized multiscale estimator instead the cross terms are of order Afjy and will have effect 
upon the asymptotic distribution. In Proposition | A.ll| a limit theorem is stated where the weak conver- 
gence also holds conditionally and unconditionally on the paths of the efficient processes. The asymptotic 
variance AVARcross includes the influence of the asymptotic degree of regularity of asynchronicity. 
The error due to discretization of the one-scale subsampling estimator yields: 

1 ^ 

1 ^ 

1 ^ 



7E( E ± Ar.,)-[x,r], 



j—i k—j—i-\'l 
7V-1 



fe=j-i+i 



fc— j — i+1 fc— J — -i+l 



7E( E 

j—i k—j—i+1 l^r 

l,re{3-i+l,...,j} 



N-1 



k=j-i+l 



k=j-i+l 



^^T,,\ E (^T.-lA. + j)+AyT,,,( E (^T.-^W.; 
A;— J — z+1 k—j—i-\-l 
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We have written the overall discretization error of the one-scale estimator as the sum of a discretization 
error of the closest synchronous approximation 

E (^^^. 1^ " It (' ~ 5) ^^'^'-) ^'^''^ ^^^^ 

and the asymptotically negligible error due to the lack of synchronicity. A stable central limit theorem 
using the theory of ifTSl for the leading term of order il'^N^^^'^ that will drive the asymptotic distribution 



is postponed to Proposition A. 4 The error due to asynchronicity is treated in Proposition A. 10 
The discretization error of the generalized multiscale estimator is of order il/^^iV^^/^ and that of the one- 
scale estimator of order il^N~^/^. There is a trade-off between the error due to noise and the discretization 



error for both estimators. For the generahzed multiscale estimator these are of orders N^/^Mj^''^ and 



,-3/2 

'n 

Afj^^A^^^/^, respectively. Remaining other terms are of orders Thus, choosing M^r = Cmuiti' N^^^ , 

the total estimation error is minimized and of order M^^^ — N~^^^ which constitutes the optimal rate of 
convergence in Theorem [3] 

The weak convergence of the discretization error is proved to be stable, so it converges jointly in law with 
every bounded J^-measurable random variable defined on the same probability space. Since the asymptotic 
normality of the cross term and the error due to noise holds both, conditionally and unconditionally given 
the efficient processes, and the discretization error is independent of and we can apply a central 
limit theorem for mixing triangular arrays as in f28l to the sum that is adapted with respect to Aj — 
a (e^ , t/c < Tj+i , , Tfe < T^+i , Tt^ ) where T is the cr-algebra associated with the efficient processes. 
The asymptotic variance is the sum of those of the uncorrelated addends. With the Cramer- Wold device 
joint normality and asymptotic independence of the different errors can be concluded. 



This is likewise for the one-scale estimator and Corollary 4. 1 Choosing the subsampling frequency i]^ — 
Csub ■ N'^^^ balances the variance of the error due to noise which is of order Ni^"^ and the discretization 
variance of order iN~^. 



5. Asymptotic variance estimation 

The asymptotic variances ( [T8] l and (|20| of the generalized multiscale estimator ( [T2] i and the one-scale 
subsampling estimator ( [T3| l, appearing in the stable central limit theorems in Theorem[3]and Corollary |4.1[ 
are random and depend on unknown quantities. In this section, we aim at estimating these asymptotic 
variances consistently to make our limit theorems feasible. 

It is a known result that a consistent estimator of the noise variances is given by ( |24) l (cf. ||3T1 ). Furthermore, 
the estimators for 77^ and yyy are asymptotically uncorrelated on Assumption|3] since the uncorrelated noise 
terms dominate the correlated Brownian parts. The constant Ix{T) + Iy{T) in the noise part of ^TE\ can 
be estimated with the empirical version Ix{T) + lyi^) that converges as TV ^ 00 on Assumption [s] 
Eventually, consistent estimators for the discretization variances and the variance due to cross terms for the 
multiscale estimator are required. 

We propose histogram-type estimators using bins according to timescales associated with the quadratic 
variation of synchronized sampling times and associated with the degree of regularity of asynchronicity, 
respectively. For this purpose, given a chosen number of bins ifjv, with Kj^ — > 00 and K^^N — > 00 
as 00, we determine the assigned non-equispaced bin-widths AGj^ = Gj^ — Gf-i, A(/x)^ — 

(Ix)^ - and A(/y)f = - „ J &{!,..., Kj,}, where 

Gf inf {t e [0,T] | G^(t) = (N/T) ^ (AT^^y > {j/K^) ■ G^(T)} , 

j e {1,..., Kn}, and analogously for the functions and if (T) > and (T) > 0. Set 
Gq — {Ix)o — {Iy)o ■= and recall that those functions are monotone increasing on [0,T]. On each 
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bin we calculate multiscale estimators in the same spirit as jT2|ljind its univariate version from ll30l for the 
increase of the quadratic (co-) variations that are denoted A [Xj^jv, A [y]g,jv, A [X, Y]qn, A [XJ^^j^^n 

and A in the following. The underlain idea is to approximate the continuous random processes 

{cr^ ptY , [c'^y' and (cr^)^, or rather their time-transformed versions, by locally constant functions. 
This construction leads to time-adjusted histogram estimators 

] ^ for I G'(t){a^aUtfdt, (22a) 

for ( G' {t){a^ f dt, (22b) 

"'0 



AGf 


/ Kn 


A[X]c„A[r]cf \ ( 


(AGf)^ j 




1 i^m 








1 I^iT) 


A(/x)f 


' Kn 



Kn 

^3 = > : I -TTT^ 1 ^IF^ for r dt, (22c) 



for / I'x{t){a(f dt. (22d) 
Jo 



Proposition 5.1. The asymptotic variances ( |18| l and \2Q) of the generalized multiscale estimator \12\ and 
the one-scale subsampling estimator \\3) with Mn — CmuiuN^^^ ond in ^ CgubN^^'^, can be estimated 
consistently by 



T 



CrauiJ^T (A + h) + c-li,}^ {^{l + h) + ri(l + /4)) , (23a) 



AVARsub = c;„',47^?| + csub^ (/i + 12) , (23b) 
where are the estimators ( |22a[ )-( |22d) t and 

n m 

i=i j=i 

Remark 1. Convergence rates of the estimators ( |23a| i and ( |23b| l for the asymptotic variances depend on the 
smoothness of ct^, <7^ and p. For current stochastic volatility models as the Heston model, they are N^^^- 
consistent when choosing Kn — ckN^^^ for a constant ck and Mn ^ N^^^ for the binwise multiscale 
estimators. 

In the absence of noise, a consistent estimator for the asymptotic variances 2T G'^{t)(af)*dt of the 
realized volatility has been proposed in IS] as (2n/3) J^^^ii^-^tJ'^- In the bivariate synchronous setting 
(n/2) '^"^li^Xf^)'^ ((AFt.)^ + (AYj-^J^) is a convenient estimator. Consistency can be proved with 
Ito's formula and partial integration and comprehended by the analogy to a bivariate Gaussian distribution 
{X,Y) ^ N(0, S) with a covariance matrix E with entries cr^, cry , pcrjf cy. Then, EX"^ — 3(T^ and 
E [X^Y^] = 2p2cr^CTf- + fT^fTf, hold true. 

In the noisy case smoothed versions of the estimators (using multiscale or alternative methods) are adequate 
(cf. ||9l)- However, in the non-synchronous non-noisy setting, there is no direct extension available and for 
that reason we have made the effort to construct the consistent histogram-based estimators (|22a|-(|22d| 
above. 
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6. Discussion and application 



6.1. A case study 

We have learned by now that in a synchronous setting the special version of the central Umit Theorem 
[3] from Proposition 3.1 holds true. Since asymptotics of the estimator ^V2\ not hinge on interpolations in 
the signal term the same central limit theorem applies in the case of intermeshed sampling introduced in 
Section [3] Now we focus on observation schemes that arise as realizations of two homogeneous Poisson 
processes that are mutually independent and independent of the processes X and Y . Although this model 
can be criticized for its flaw that sampling schemes of two correlated processes follow two independent pro- 
cesses and time homogeneity, what might seem to be unreaUstic in financial applications, independent and 
homogeneous Poisson sampling constitutes the most commonly used model in this research area (cf. |[30l . 
iTiSJ among others) and appertains to show that the general form of ( fTSj ) is tractable. 
Letn(")(t)andTO(")(t) be sequences of two independent homogeneous Poisson processes with parameters 
Tn/Oi and Tn/6'2 (n € M), such that the waiting times between jumps of n' "' and m'") are exponentially 



distributed with expectations E 



in) 



Oi/n andE 



Ar 



(n) 



02/n , « e M, j e E^. In this case 



AT, 



(n) 



F{t) = 1 - cxp - 



tn 



exp 



tn 
'O2 



■ exp —tn 



1 

O2 



,k e 



hold true and we derive the following Poisson sampling version of Theorem[3] 

Corollary 6.1. On the Assumptions^and^ the generalized multiscale estimator ( |12| ) with noise-optimal 
weights ( |A.l| l, and = c„iuiti ■ V'^, converges conditionally on the independent Poisson sampling 
scheme with < 0i < 00 and Q < 62 < co stably in law with rate N^^'^ to a mixed normal limit: 

. / multi \ / 

N'/^ [[X, Y]^ - [X, Y]^j 4 N (0, AVAR^„°-,^ 
with the asymptotic variance 



% "poiss -3 f A o 2^]^ ^2 (22 1 

^multi ~ ^multi I 24 + 12 ^^^~7j7^^ ) ^xVy + ^rnulti 



5 



The asymptotic variance of the N^^*^ -consistent one-scale estimator becomes 



2 /"^ 26'?6i2 



The order for the supremum of time instants in Assumption l2] holds in probability and the proof in 
Appendix A stays valid. A Poisson sampling version of Theoremplis given in IT), where the stochastic 



limit of (J^ is deduced using the distribution of a maximum of two exponentials stated above. 
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6.2. A bridge between the noisy and the non-noisy setup 

So far we considered noise variances not dependent on N, but from an applied point of view there 
is interest in the case where the noise level may vary with N ^ n ^ m. The primary motivation to 
accommodate dependence of the noise on the sample size in the model originates from the economic 
background. Empirical studies of (ultra) high-frequency financial data suggest to rather model the observed 
log-prices as sum of a latent semimartingale and noise for that the variance decreases in N as reported in 
II20I and among others. 

If an estimation approach uses previous-tick interpolations, as the one proposed in |5 |, these methods are 
not accurate for that setting any more. This also becomes apparent in the simulation study in |8 1 when the 
performance of the estimators is compared for varying noise levels. The generalized multiscale estimator 
is not biased due to asynchronicity and passes over to the Hayashi-Yoshida estimator (|8]l for Mm = 1, a 
^/N -consistent estimator in the complete absence of noise. For that reason, our estimation method achieves 
an improved convergence rate in the model with decreasing noise variances. The next Corollary is obtained 
by a direct extension of the proof of Theorem [3] in [Appendix A| when replacing the moments of the noise 
processes. A similar extension for the one-scale estimator where we obtain the rate s + t for a subsample 
frequency = Csub-^^'^ holds analogously. 

Corollary 6.2. Consider the model imposed by Assumption^^^and^ but with noise variances r]^ (N) — 
CxN~°' , r/Y — CyN^" , < a < 1 and constants < < 00 , < < 00. The generalized 
multiscale estimator \\2\ with Mn = CrnuitiN2~^ and optimal weights ( |A.l| l converges stably in law to 
a mixed Gaussian limit: 

m + ^ i^[X, Y]^ - [X, Y]^j 4 N (0, AVAR:„„;,J (28) 

with the asymptotic variance 

AVAR* -3 fo^ ' ,n IxiT)+lY{T) \ ^ UCxCy 

AVAR^„,,, = c,„„;,^ I 24 + 12 1 (xCv + c„^um 1 

26, '■^ 



5 



Jo 



Incorporating a pure previous-tick interpolation strategy as in fS), one cannot gain an improved conver- 
gence rate in that setting due to the bias by non-synchronicity effects. 

6.3. Application 

6.3.1. Note on modeling assumptions 

For an application to financial time series data the conditions imposed by Assumptions [T]|3] seem to be 
restrictive and the model will not describe stylized facts of the data in an adequate way. In particular relax- 
ing the i. i. d. assumption on the noise is important. On the other hand, the underlying model in the sections 
before is convenient to establish an asymptotic distribution theory and ascertains a closed-form expression 
for the asymptotic variance. Yet, the generalization for serially dependent observation errors as carried out 
in 111 for the one-dimensional case is possible for the generalized multiscale estimator ([T2| as well. On 
the assumption of stationary strong mixing noise processes the multiscale estimator remains consistent and 
rate-optimal without any further adjustment. The analysis for the synchronous case can be adopted from 
||2|, but for the general non-synchronous noisy setting a closed-form expression of the asymptotic variance 
and a corresponding limit theorem is not available. Furthermore, the condition that noise processes are mu- 
tually independent can be relaxed if one wants to allow some correlation E ej^e^ = rfxY '^j 
located near each other. In any case the generalized multiscale estimator remains asymptotically unbiased 
and Af^/*-consistent. This does not necessarily hold true for the one-scale estimator that would have to be 
bias-corrected as the TSRV estimator by 1,31 J in the one-dimensional case. 
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More general efficient processes including jumps can be covered in the model when we combine the method 
to a two-stage approach as presented by 111] for the one-dimensional estimation approach. Considering 
semimartingales 



Jo Jo Jo Jo i^-^ 



with locally bounded drifts, continuous volatilities and counting processes , counting the jumps 
of X and Y with jump sizes L^, / — 1, . . . , and Lj , I = 1, . . . , JY , respectively, the generalized 
multiscale estimator converges in probability to the total quadratic covariation 



T 







0<s<t 



where AXs — Xg ~ ^s,-, AK, — Ys — Ys_^, and the second addend is the sum of the simultaneous 
co-jumps. When one is interested in disentangling the continuous part from the jumps, one convincing 
possibility following 111] is to use wavelet methods that locate the jumps in the sample paths, estimate 
jump sizes and afterwards use our estimation approach for the validated observations. 
In conclusion the generalized multiscale estimator ( [T2] i is capable for usage in various applications. In 
||8], we have approved that the estimator performs well and has satisfying finite sample size features in 
simulations including serial dependent noise and typical stochastic volatility models. 

6.3.2. Choice of tuning parameters 

An implementation of the estimation approach requires first a rule to choose tuning parameters. We 
provide an accurate algorithm to implement the estimators and to obtain estimates for their asymptotic 
variances. One plausible selection of the constants Cmuiu — VN /Mn and Cgub — N^^^/^n can be 
derived as solutions of the minimization problems of the asymptotic variances. This leads to formula ( |29] l 
in Algorithm |2] 

The tactic of Algorithm|2]is the following: Evaluate a pilot estimate c^^^j^ j for Cmuiu as solution of formula 
( p9l l inserting a sparse-sampled estimator for the signal term. Then set up the estimation of the asymptotic 
variances involving the estimators ( |22a| i-( |22d] i. Take = c^^uiu^-^-^N fixed for the multiscale esti- 
mators on all bins and set = c\^uiii\/ NckN^/^ where ck = Kjj^N^/^. This selection is optimal 

m 

5/4 



for common volatility models. We obtain c'^^iu'^K^^ ~ '^muiu from the orders of the different errors 



of the histogram estimators = cl^ . Hence, = (c|^^;j,) and ck = \p^u is derived. 

Using estimators ( |22a| i-( 22d| i, we calculate estimates for the addends of the asymptotic variance and Cmuiu 



according to formula ( |29l ) again and Mn — \cmuitiy is used for the final estimator. It turns out that 
this strategy it quite robust to the a priori chosen sparse-sample frequency that can be chosen under the 
impression of usual diagnostic tools as signature plots and acfs. 

6.5.5. Simulation and data analysis check 

As completion to the detailed simulation study in the supplementary material to fSl we investigate the 
performance of Algorithm |2] and the histogram-estimators ( |22a| l-( |22d| l here. For this purpose we simulate 
from an simple Brownian motion model with zero drifts and constant volatilities ct^ = = 1 and p — 
1/2 with equal noise variances rj^. Sampling schemes are generated by independent time-homogeneous 
Poisson sampling with 30.000 expected observation for both processes on [0, 1]. Results of the estimates 
are listed in Table [2] In addition, our approach is tested in an application to EUREX future tick-data 
taken from a database provided by the Research Data Center (RDC) of the CRC 649 'Economic Risk' in 
Berlin. We aim at estimating integrated covolatilities between the four financial securities with the highest 
tick-frequencies in the database. These are the Euro-Bund Future (FGBL), that is based on a notional 
long-duration debt instrument issued by the Federal Republic of Germany, the Euro-Bobl Future (FGBM), 
a likewise medium-duration contract, and futures on the EURO STOXX 50 (FESX) and the German DAX 
(FDAX). 
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• Choose a priori L and calculate pilot estimator AVAR^^^jj with 



AVARd,,=- 

fc=i 



and [X]^ = Y.,=kL.k>i {^t, - Xt^^^) and [F]^ analogously and = (T) and /^(t) = 

ly {T). Calculate rfy. and ?7y according to ( |24] i. 
• Use pilot estimates to estimate optimal constant(s) 

^ -1/2 

/ -avar!!,.„,.,„ + ,/f avar!!_,, „V + i2av^!J,,,avar!' \ 







V 1 


^ -^^^-^ -f^ c r o s s , n 


+ 12AVARd,^AVAR„ 



6AVAR^ 



(29) 



andcil = ^2AVAR^^,„,/AVArL^,„,. 

• Calculate Ii — I4, given in ( |22a[ )-( |22d] i, with 

Kn = V^iVV^ bins and M^{j) = (cl^^,) '^V^/^ V^' . 

• Estimate asymptotic variance with Ii — I4 and 77^ , rfy and determine Cmuiu and c^uh with the above 
given formulae. 

• Calculate the generalized multiscale estimator ( [T2] l with optimal weights jA.l) (and the one-scale 
subsample estimator) with Mjv = CmuiuV N (and ijy = Csub^^^^) 



Algorithm 2: Algorithm for the estimation procedure. 



We apply the procedure with Algorithm [T] and Algorithm[2]to a suitable filtered dataset and give the results 
for two days in Table [3] with associated estimated optimal multiscale frequencies. The estimates for the 
quadratic covariations are given ± estimated standard deviation from the estimated asymptotic variances 
and the pertaining N for each pair. Although there are characteristics of the dataset not in accordance with 
the model assumptions, above all price discreteness and the fact that most returns are zero, the estimation 
approach passes this intuition check. Since the ESX and the DAX share 13 companies constituting c. 28.5% 
weighting in the ESX and c. 72.4% in the DAX there is a big systematic positive correlation between both 
and we presume that there is as well a high correlation between the two debt instruments which is both 
revealed by the estimates. On 09/11/2001, there has been an tremendous impact so that FGBL/FGBM 
have increased and the FESX/FDAX decreased. For that day we have significantly negative integrated 
covolatilities between debt instruments and stock indices which is not the case for the common trading day 
in comparison. As answer to the great amount of zero returns the only adjustment of the method that we 
undertake is to estimate noise variances in (|24]i by dividing the realized volatility by twice the number of 
non-zero returns instead of all returns. 

To sum up, the estimation approach based on Algorithm [T] a multiscale extension of subsampling and 
Algorithm |2] provides a convincing method to obtain integrated covolatility estimates for very general 
high-frequency data. 
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Table 2: Estimators j22aj-j22dj, estimators for the asymptotic variances of the multiscale jTSj and the one-scale estimator \20\ , 
calculated asymptotic variances and estimates for the quadratic covariation. The estimates are given it empirical standard deviations. 



noise var. rj 



0.0001 



(0.001/VlO) 



0.001 



(0.01/VlO) 



0.01 




0.392 ±0.038 
1.557 ±0.067 
0.250 ±0.007 
0.250 ±0.007 
0.090 ± 0.003 

0.0663 
0.0086 ± 0.0002 

0.0077 

0.501 ±0.024 
0.500 ±0.022 



0.390 ±0.047 
1.552 ±0.085 
0.249 ±0.010 
0.249 ±0.009 
0.143 ±0.005 

0.1185 
0.017 ±0.001 

0.0166 

0.499 ±0.029 
0.499 ±0.028 



0.394 ±0.073 
1.538 ±0.141 
0.249 ±0.016 
0.249 ±0.016 
0.246 ±0.015 

0.2159 
0.037 ±0.002 

0.0357 

0.498 ±0.038 
0.499 ±0.042 



0.413 ±0.144 
1.529 ±0.276 
0.247 ±0.031 
0.247 ±0.030 
0.434 ±0.050 

0.3774 
0.080 ± 0.009 

0.0768 

0.499 ± 0.049 
0.500 ±0.058 



0.423 ±0.128 
1.462 ±0.230 
0.234 ±0.085 
0.233 ± 0.083 
0.778 ± 0.082 

0.6737 
0.157 ±0.017 

0.1656 

0.501 ± 0.065 
0.503 ± 0.074 



muLti 

(MSFR) 


FGBL 


FGBM 


FESX 


FDAX 


FGBL 


5.46 ± .87 (7) 


3.32 ± .74 (7) 


0.98 ± .99 (9) 


0.52 ± .74 (6) 


FGBM 


r 


2.42 ± .47 (6) 


0.78 ± .69 (7) 


0.64 ± .51 (4) 


FESX 






68.26 ±7.1 (10) 


29.39 ±2.4 (7) 


FDAX 








61.43 ±3.5 (5) 



- — ■ — ■ — - niulti 

[X, Y]j, (MSFR) 


FGBL 


FGBM 


FESX 


FDAX 


FGBL 


27.89 ±4.1 (15) 


12.94 ± 1.7(12) 


-52.55 ± 18(8) 


-34.13 ± 11 (7) 


FGBM 




18.10 ±2.5 (8) 


-26.44 ± 14 (6) 


-25.01 ±28 (3) 


FESX 






3070 ± 172 (6) 


757 ± 15 (4) 


FDAX 








1870 ± 94 (4) 



Table 3: Estimates for integrated covolatilities, standard deviations (■ 10*') and used multiscale frequencies for 10/01/2008 (top) and 
09/11/2001 (bottom). 



Appendix A. Proof of Theorem |3] 

Appendix A.l. Error due to noise and choosing the weights 

The error due to microstructure noise of the generalized multiscale estimator is given by 



E ^ E (^^ - <-..0 « - = E ^ (E {4^1 + 4^) 

2—1 j—i i—1 J — 1 

N i-1 N 



J=1 



3 = 1 



Additionally to the standardization condition 



E O^iMN = 1 ' (CI) 
1=1 

that is necessary for asymptotic unbiasedness and consistency, we now impose the auxiliary condition 



E 

i=l 



, 



(C2) 
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on the weights which assures that the leading term of the noise error equals zero. Hence, there remain three 
uncorrelated addends in the error induced by microstructure noise. 

Proposition A.l. Let Assumptions^and^on the observation times and Assumption^on the observation 
errors hold true. The asymptotic variance of the term 

Mjv N 



2^ i v'^ff.%-.+i 



is minimized by the weights 

opt _ ( 12P 6z 6i \ 12i'^ 6i 



(l + o(l)) (A.l) 



as Mjv, N oo and Mjy /N with N — o (M^). The following asymptotic normality result holds 
true: 

, ,3 / Mm opt N 

^ ( E ^ E {4^.-... - <C ) I - N (0 , AVAR_) , (A.2) 

with the asymptotic variance 

AVAR„„„, = (24 + \2{Ix{T) + lY{T)))vjcVY (A.3) 

with the functions Ix and ly defined in Assumption^ The weak convergence also holds true conditionally 
given the paths of the efficient processes. 

Proof. The term is centred and we illustrate it in the way 

M„ N N MmM 



E <^i,MN ( X Y _^^Y X A _ \^ \^ Ctt.MN fx Y 

2—1 j—i J — 1 ^—1 



For fixed i the addends of the inner sum are uncorrelated because li 7^ Ij and Aj 7^ Xj for all i j. 
Consecutive right-end points gi, % can be the same observation times instead, so that the inner sums are 
2-dependent random variables. Thus, the variance is given by 



N Mm A] 

C^iMn ( — fY 

, j=i ^ i=i 



Var E ( 



N A/jvAj N (A/jv-l)A(j-l) 

(<^i,MN\ o 2 2 I OtiM„aij^i,MM 2 2 /i n \ 

E \r^) ^"^xVy+Z^ 2^ JT-TY) '?A-'?y(l-lte#ff, + i,7.#7. + i}) • 

j = l 1=1 j=l i=l ^ ' 

The weights that minimize the first addend of the above variance and also the total variance asymptotically 
have been determined in (SI. Those weights are in line with the standard weights from ll30l in the univariate 
setting and correspond to a cubic kernel for the kernel estimator by |5 1. 

Inserting the noise-optimal weights ( |A.l| i, we can apply a central limit theorem for strong mixing triangular 
arrays from ESl to 

3 N opt A/jvAj 
^^N \ " "»,A/w ( fX f"^ fX 

^ 2^ i \ 9j >'i~^ + l^ l3-i^l l3 

j=l t=l 
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The sequence of variances with the chosen weights according to jA.l) 

r3 ^ AfivAj" 



Var 



N ^ i 



V^^^Mjv ( X Y , X 

2^ i 2^ v'^ff/A.-.+i 



3 JV Af„AO + l) / opt \ 2 
J=0 i=l \ I 

m7^\T)\ - 12(1 - (/x(T) + lYimvWv 
converges to AVARnoise on the Assumption|5] 

Since the inner sums are 2-dependent and hence in particular (/)-mixing, the Lyapunov condition that holds 
obviously suffices to apply the central limit theorem of ||28| . This completes the proof of the proposition. 

□ 

Next, we consider the remaining addends of the error induced by microstructure frictions and insert the 
weights (|A.l|i: 



Proposition A.2. On the Assumptions^ |5]ant/|5] the following weak convergence to a centred normal 
distribution holds true: 

( Mm opt A-1 N \\ , . 

E4<+ E « N N (O, ) . (A.4) 



z— 1 

This convergence also holds conditionally on the paths of the efficient processes. 
Proof 

E^Y^ E'^^'^7, + E ^h'^^A E ('^^^7, +^^^-/L-J E 

1=1 \3 = 1 j = N^i+l I 3 = 1 i='J + l 



Both addends are uncorrelated and treated analogously. We restrict ourselves to the proof for the first term. 
\/Mn ^fj^ij '^i=^+i '^Tmi^I^ endpoint of a discrete centred martingale with respect to 

the filtration := a (ej^ \tk < gj , Xt^ \0 < k < n) V a (e^^ jr^ < jj , YrjO < k < ni) . Namely, since 
9] = 9j~i ^ 7i > 7j-i and 7^ = jj^i ^ > g^^i analogously: 



E 



Mn opt 

A central limit theorem for martingale triangular arrays from llT4l is applied, in particular the non-stable 
version of Corollary 3. 1 (cf the following remark in (14] and references cited therein). The conditional 
Lindeberg condition can be verified by the stronger conditional Lyapunov condition. The proof of it is 
obtained by a similar calculation as the following one and we omit it. The conditional variance equals 

2 



A/jv-l / Mn opt 

E E 




X 1 Tfx-nl 



l{g,5^ff,_i,7,5^7j_l} + + («7,) l{7,=7,-l}^i 



6 2 2 

^VxVy 



We have used the formula E^i" ' (E,=j+i MN{a°%Ji)) = (6/5)M^i + o(A/^^). 



□ 
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A.2. Discretization errors of the estimators 

Proposition A.3. On the Assumptions^ ^and^ the discretization error of the one-scale subsampling 
estimator with subsampling frequency i n converges stably in law to a centred mixed normal limit as in ^ 
oo,N oo,iAr/iV" Qforeverya > 2/3; 



In 



N 



{Xg, - {Y^, - 1a,_,+0 - [X, r]^ 4 N (0, AVAR,,,^,„f,) 



with asymptotic variance 

AVARd^s,sub =It I G'{t){afaYf{l + pi) dt . (A.5) 
-J Jo 



Proposition A.4. On the Assumptions 7|2 and^ the discretization error of the generalized multiscale 



estimator with the noise-optimal weights given in ( |A.l| l converges with rate ^ N /Mjq stably in law to a 
centred mixed Gaussian limit as Mn — oo, N — oo, M^/N" Ofor every a > 2/3; 



-J^ / Mn ^opt N \ 

_ ^ ^ ^ ^^^^ _ ^y^^ r]^ 4 N (0, AVAR,,,,w.O , 

\i=l j=i J 

with asymptotic variance 

2fi /""^ 

AVARd,,,™„it, = -T G'it)ia^alfil + pi) dt . (A.6) 
A.2.1. Time-change in the asymptotic quadratic variation of time 

Proposition A.5. In the proof of a central limit theorem for the discretization error of the closest syn- 
chronous approximation T^^\ k — 0, . . . , iV, of our generalized multiscale estimator \\2\ on the As- 
sumptions^and^ we can additionally, without loss of further generality, assume that 



EfAT^-l) -o{N-^). (A.7) 

k=l ^ ^ 

Remark 2. From Assumptions [T] and [2j we can deduce directly that the sum above is at most of order 
A^^^. The stronger assertion, that the closest synchronous approximation defined by the times ri^^ fc — 



0, . . . ,N introduced in paragraph 3.2 is close to equidistant sampling in the sense that the sum above is of 
smaller asymptotic order than N^\is derived by the concept of a time-change in the asymptotic quadratic 
variation of time from Assumption]?] For the proof of ( |A.7[ ) we refer to Lemma 1 from ll30l where this 
concept has been presented for the univariate multiscale approach and it directly carries over to the syn- 
chronous multivariate case. 

On the Assumption|4] a transformation g can be defined that maps the refresh times T^^^ to values (/(T^^^ ), 
so that ( |A.7| i holds true for the transformed synchronous observation scheme. Thanks to the fact that the 
corresponding time-changed processes and A/gj^j fulfill AssumptionjTjagain and the transformed ob- 
servation scheme Assumpti on |2] we are able to prove a central limit theorem for the time-changed version 
of the discretization error if ( |A7| i did not hold. 

Since the resulting asymptotic variance will be invariant under the transformation g, the central limit the- 
orem will analogously hold true for the original sampling scheme. Hence, no further restriction has to be 
made when assuming (|A.7|l. 



A.2.2. Discretization error of the closest synchronous approximation 

Note that it suffices to prove the foregoing limit theorems for the zero-drift case. Since our limit 
theorems are stable, asymptotic mixed normality is assured to hold for the general setting on Assumption 
[l] Denote Lt = /J a^dW^ and Mt = /q a^dWy the continuous martingales that represent the efficient 
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processes under the equivalent martingale measure after a Girsanov transformation. The Novikov condition 
has been imposed in Assumption[T]to allow for this transformation. 

The asymptotic mixed normality result is implied as marginal distribution at t = T of a limiting time- 
changed Brownian motion which is proven to be the stable weak limit of the process corresponding to the 
discretization error ( pTj i with the theory of fTSl. 

We begin with the discretization error of the closest synchronous approximation of a one-scale subsampling 
estimator. 



Proposition A.6. On the same assumptions as in Proposition A.3 the continuous martingale 



N 



g,K^/>-'«*-^)(|('-0^*-') 



for t G [0, T], where A ■ — ■ ~ ■ t^-i the backward difference operator, converges stably in law 
as N (X),iN — > oo, in/N — > fo a limiting time-changed Brownian motion 



s ; 



where ^U-*- is independent ofT and 



Proof of Proposition A.6 



The subscript of the subsampling frequency is omitted in the following proof and C denotes a generic 
constant and Sn = supjg|;^ jy} (Ti — T,;-i). 

We apply a simplified martingale version of the stable central limit theorem 2-1 from fTSl. For other 
applications and expositions of the theory from l,18J we refer to 124] . I,12J and L7J. The above limit theorem 
is implied by the following three conditions: 



wj) ds 



(A.8a) 



[D,L], ^0, [S),M], 



0, 



(A. 8b) 



(A.8c) 



for all t e [0,r] and for all M-^ G Ml. and L-^ e £_L that denote the set of (J')j-adapted bounded 
martingales orthogonal to M and L, respectively. 
Calculating the quadratic variation of yields 



N 



'i/\k 

E 


(- 


^/ = l 


'lAk 




n 



^ iAk 



\i=i 



Tfc<t = l ^ ^ I \l=\ ^ ^ 



Op(l) 
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N 
iT 



iAk 



Tk<t 
iAk 



1^1 



E ^ Wt. E 1 - 7 (^^^^-0 + E ^ 



iAk 



E 1-7 (AiT._,) +2E^[^'*^kE 1 



' iAk 



Tk<t 
I 

i 



o,(l) 



Tk<t'' ^k-l \ 

Tfe<t -''=-1 \/=l ^ 

+ 2^ / P.-M'^ME 1 



Tk-i 



{-I Yds 



k-l-l 



Tk<f-'i'-i 



Tk-i 

2 .Tk 



Tk- 



TV 

Lemmi fTTl iT 



iAk 



E 



Tk<t 



Tk<t 

3 



3 Tfc — Ifc-i 

2 







In the first step cross terms of the inner sums have been neglected since they are centred and by Ito isometry 
it can be shown that their second moments are bounded from above by Ci^5%. We frequently use estimates 
(jjy^^ for sums of the type ^^^(AT'i)' > ^ > Ij by Holder's inequality with the supremum norm to obtain 
upper bounds. 

Subsequently squared increments of L and M and the increments of the product L ■ M m these inner sums 
are substituted by the increments of the quadratic (co-)variation processes. The induced error terms are 
centred by Ito isometry and involving Cauchy-Schwarz inequality it follows that CSiq is an upper bound 
for their second moments. 

The crucial non-standard approximation is that on each block (Tfc-i, . . . , Tk-iv{)) the increments of the 
form /p''"' f{t)dt with continuous functions / for Z = 1, . . . , fc A i are approximated by ATkf {Tk-i). 
This blockwise approximation is treated in Lemma fA.?] and makes use of the concept of a time-changed 
quadratic variation of times and particularly ( |A.7| i. Finally, ~ (V*))^ ^ 1/3 + ^(l) and the 

convergence in probability is ensured by Assumption |4] and the convergence of the Riemann sums to the 
integral. 



Lemma A.7. On the same assumptions as in Proposition \ A.3\ it holds true that the term 



N ^ 



Tk<t 



Tk 
Tk 



' iAk 



k-i 



iAk 



1=1 



and the analogous blockwise approximations of A[M]t^. and A[M]t^. through constant left-end points 
converge to zero in probability. 

Proof The approximation uses the concept of a time-change in the asymptotic quadratic variation of re- 
fresh times introduced in |30| which is expounded in A. 5 By virtue of that concept we may suppose with- 
out loss of generality that the sampling design of the closest synchronous approximation satisfies ( |A.7| i. 
The asymptotic orders of the three terms are deduced analogously and we restrict us to the proof of the 
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above given first term. An application of the mean value theorem yields 



N 
iT 



Tk<t 



N 



Tk-i 



/ iAk 



2 rn-i 



Tk<t 



with (k G [Tfc-i, Tfe], Q £ [Tq^i,Tq]. Since the volatility processes , are uniformly continuous on 
[0, T] by Assumption[T] 



E fi - 0' (4 - (-n-.r ^n-i < ^SN sup \{arr - i-rr 

l^l \ */ |t-s|<i(5jv 



Cla.s.(i<5Af) 



Tfc<t 



E k^a)'-K_J' K^J^AT,^ Ar,_, <*<5Ar sup |(arf-(ar)2| = Oa.,.(z5^) 

^ |t-s|<(5N 



hold almost surely (denoted a. s. ). 

With the Cauchy-Schwarz inequality and (|A.7|l, we obtain 



N 
if 



iAk 



Tk<t 



< ^ sup {afarYj2Y. 



iT 



N 



AT, 



N 



T 

N 



T 

N 



AT 



3+1 



1/2 



<^c\Y,{T^,+,),^-T,yY.{AT 



Furthermore, 

N 



iT 



Tk<t 



T 



Oa.s. (1) 



Efi-J) <f^(E(A^.rEfAT, 



1/2 



( = 1 



holds, where the right-hand side converges to zero almost surely due to ( |A.7| i and the Cauchy-Schwarz 
inequality. The preceding estimates imply the statement of the lemma. □ 

We proceed proving ( |A.8b| i that the quadratic covariations [S)^, L] ^ and [S)^, M] ^ converge to zero 
in probability for all t £ [0, T]. 



iAk 



Tfe<t' 



iAk 



^1=1 



has an expectation equal to zero for all t E [0,T] and the second moment is bounded above by iNSj^. 
The order follows from the evaluation of the second moment that is carried out analogously as for the 
calculation of [S)^] ^ before. For this reason [S^, L] converges to zero in probability on [0, T]. It can be 
directly deduced that [D^ , A/] ^ — Op(l) as well. If L-^ is a bounded (J^t) -martingale with [L, L^] = 0, 
the quadratic covariation 

^"]. - /I E A (e (i - !) ^^-^-) 

Tk<t \l=l ^ / / 
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converges to zero in probability on [0, T] what can be concluded following the same principles and also that 



[2)^, A/ J 
the proof of Proposition] A. 6 



Op(i) e t e [o,r] 



An application of Jacod's Theorem from ifTSi completes 

□ 



Proposition A.8. On the same assumptions as in Proposition A.4 the continuous martingale 



N 



N 



E 



El 



J2 Ult, + f a^dW, 

|:.K-X>=-')(|(' 



for t S [0,r] converges stably in law as N ^ oo,Mn — >■ oo, /N° 
limiting time-changed Brownian motion 



Q for every a > 2/3 to a 



where is independent of T and with 



26 
35 



Proof of Proposition A.8 



The discretization error of the generalized multiscale estimator calculated with the closest synchronous 
approximation under the equivalent martingale measure where the drift terms equal zero 



Mn ^opt N 



E ^ E (Lt, ~ iT,_.) (Mr, - Mt,_^) [X, Y], 



E' 



opt 



equals the weighted sum of Mm ~> oo discretization errors of the type considered in Proposition 
because X^t^i '^Tm^ ~ ^' ^o'^^' '^^at all approximation errors in the preceding proof of Proposition 
converge to zero in probability as long as — > oo, i/N" — >■ for every a > 2/3. 
We begin with the proof of a multivariate stable central limit theorem for a finite-dimensional vector: 



A.3 
A.6 



Lemma A.9. Consider the sequence of K- dimensional vectors = 



I where the entries 



X>'«,fc = 1, 



,K<oo are the continuous martingales 



Ai 



T,,_ 



with a sequence of integers i%,k — 1, . . . , K. On the Assumptions [7] |2] and |4] and if for every k € 
{1, . . . , K} there exists a constant qj. with i'^ /Mn — > qk, the following stable convergence holds true as 
N oo,Mn oo, Mn /N°' Ofor every a > 2/3; 



Mm * 



,dW, 



(A.9) 
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with a K- dimensional Brownian motion W independent of T and a predictable process Wg with 



T 

iwswl)„^„ = - niin(g,„,q„) ( 3 - 



niiii(gm,g«) 
max(9„,g„) 



(A. 10) 



with the convention that for = Qn — the ratio is one. 

For we obtain the following multivariate stable central limit theorem 



N 



N 



(A. 11) 



with = 2r/p (1 + pi){a^(jYfG'{t)dt and 

^ 1 . / .(^ min(q™,g„ 

Sm„ = - mm (q™, g„) 3 . 

6 V max(q™,g„ 

Proof. Define for fc e {1, . . . , i^} the continuous martingales 



N 



M. 



.'IN 



By virtue of Proposition A. 3 we already have that 

p, 2 



The limit of the quadratic covariations 
quadratic variation in the preceding proof: 



is derived using the same approximations as for the 
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t ~ Mn 



'min (i^, 



'AT 



''mill {i'^,i%,r) 

E^MtJ E (1 



T^<t 



1 = 1 



'min (i5(;,i^,r) 



'TV 



1 - 



"■AT 



AL 



) {AMt 



Tr-l 



T^<t 



'AT 



I 



ALt _,AMt 



Opil) 



Tr.<t 



ATr 

' min ii'S,i%,r) 



E 



1 - 



'at 



1 - 




Op(l) 



min(g„,,gfc) 
max {qm,qk] 



ds 



since - - (V^-^)) = (l/2)m - {m^/6M) - 1/8 + 1/(12A/) for m,M e Z. The 

multi-dimensional version of Jacod's stable central limit Theorem 2-1 from |181 enables us to prove the 
result of stable weak convergence of the vector provided we can verify the conditions 

[D^, IL] , ^ , [D^, M] ^ ^ 0, yte [0, T] , 

where IL denotes the vector with entries V — L,j — 1, . . . ,K and M with = M, j = 1, . . . ,K, 
respectively, and 

[D^,L^]^^0, [D^,1M^]^^0, Vie [o,r] 
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where L-*- and M-'- are bounded (J"t) -adapted martingales orthogonal to IL and M, respectively. That is 
because the reference continuous martingales for all entries of the vector are L and M. The componen- 
twise proof of the conditions above is yet analogous as for the univariate case in the preceding proof. We 
conclude that the asymptotic distribution of the vector is described by a limiting time-changed Brownian 
motion on [0, T], and the marginal distribution at time T by a mixed Gaussian limit, where the normal dis- 
tribution is defined as well as for all componentwise marginals on an orthogonal extension of the original 
underlying probability space. □ 

From the preceding multivariate limit theorem the Cramer- Wold device allows to conclude the weak 
convergence of all one-dimensional linear combinations of the transformed discretization errors of a finite 
collection of one-scale subsampling estimators. For an asymptotically N (0, S) -distributed random vector 
the sum of all components is asymptotically normally distributed with variance J2i ji^ij) by the Cramer- 
Wold device and the normality of any linear sum of components of a multivariate normal distribution (see 
e.g.pp.516-517in[25|). 

The asymptotic variance in Proposition A.4 is deduced from the multivariate limit and 



E 

k,l 



Mn k 

fe=i 1=1 



6Mn 



opt opt 



+ 0(1)^^+0(1) 



with the weights ( |A.l| l inserted. 

For the completion of the proof of Propositions A. 8 and hence A.4 it remains to extend the result for 
asymptotically infinitely many addends. This part of the proof can be adopted from [^30l where a stable 
central limit theorem for a multiscale estimator for the integrated volatility in the univariate setting is 



proved. 



□ 



A.2.3. Discretization error due to the lack of synchronicity 
Proposition A.IO. On the Assumptions^and^ it holds true that 
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E (^T.-Li, 



= o. 



for the error associated with interpolation errors for a one-scale subsampling estimator 

Proof. is the endpoint of a J-j^n = -^t^(n) -measurable discrete martingale with conditional expectation 
zero, since the addends incorporate products of Brownian increments over disjoint time intervals. The 
conditional variance yields 
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The variance of the term is of order {iN)~^ which can be proved by taking the expectation of the above 
given conditional variance and an upper bound of the second moment. The asymptotic orders of the ad- 
dends follow from taking the expectations using Ito isometry and analyzing the differences of the addends 
minus their expectations, that converge to zero at a faster rate. That part is similar to the proofs above and 
we forgo a more detailed computation here. □ 

Denote A^'^ the error due to non-synchronicity and interpolations for a fixed subsampling frequency 
i = 1, . • . , Mn in the following. The error due to asynchronicity of the generalized multiscale estimator 
( [T2| i equals the weighted sum '^fL" a°^lj^A^'''. It has expectation zero and the variance is of order 

/Mm \ 
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Thus, the error due to interpolations is of smaller asymptotic order than the discretization error of the 
closest synchronous approximation and asymptotically negligible. 

A. 3. Asymptotics of the cross term 

For a one-scale subsampling estimator cross terms are asymptotically negligible and hence the stable 
central limit theorem in Theorem 4.1 is implied by Theorem A. 3 For the proof of the stable central limit 
theorem in Theorem[3]for the multiscale approach, we cope with the asymptotics of the cross terms in this 
subsection. 

Proposition A.ll. On the Assumptio ns [7] [2] |5]flnt/|5] the cross terms of the generalized multiscale esti- 
mator \\2\ with noise-optimal weights ( |A.1[ ) weakly converge to a mixed normal limit as Afjv — oo, iV — > 
oo, MnSn 0: 

Mm opt N 
^ ^i.M. 



^ N (0, AVAR,,o,,) , (A. 12) 



with asymptotic variance 
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The convergence holds conditionally given the paths of the efficient processes. 
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Proof. This proof affiliates to the discussion in Section]?] where degrees of regularity of non-synchronous 
sampling schemes have been defined in Definition [2] that are assumed to converge to continuously differ- 
entiable functions. 

On the Assumption [3] of independent observation noise of X and Y, the two different cross terms are 
uncorrected and we prove a central limit theorem for the first one: 
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ii + i{-mafydt\ . 



The parallel result for the other term can be proved analogously. 

For the purpose of a shorter notation we have left out superscripts of the observation times, and write 
ai, i = 1, . . . ,Mn for the weights although we are interested in the specific weights ( |A.l| i. Denote 

Sn = supig{i_ jv} and = min (r^ e T^\Tk £ g^+^),gj^+ = min {tk € T^\tk £ W+^) and 
C a generic constant as before. From 
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for boundary terms, we conclude that 
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Here, we aggregate the observation times 7j, Aj, j = 2, . . . , iV — 2 in disjoint sets conforming to the four 
cases discussed in Section |4] Denote thereto 

3^1 = {j e {2, . . . , iV2}|7, ^ li-i , Ij < 9j} , 

3^2 = {j e {2, . . . , N2}\-fj > 9j , 1, > gj^+} , 

3^3 = {j e{2,..., iV2}|7j > 9j , 1] < 9j.+ , > , 

3^4 = {j e {2, . . . , iV2}|7j > 9j , 7j < , 7j,+ < , 

and M'^{j) = min (7, N — j, Mm)- The increments of X that are multiplied with each observation error 
differ according to the set 3^^, 1 < fc < 4 to which 7^ belongs. We use the notation 

Ci°j = {Xtj - Xtj^,) , Ci!j = (Xtj+.+i - Xt^^i) ■ 

The resulting aggregated leading term above of the cross term is the endpoint of a discrete martingale 
with respect to the filtration J^j^n := a[e^jTk < 7j+i , X,Y). Since if j G 3^4 => < Jj+i, the 
martingale property with respect to the filtration J^j^n is assured by Assumption[3] 

An application of the non-stable version of the central limit theorem for martingale triangular arrays from 
lfr4J will proof the asymptotic normality of the cross term conditionally on the paths of the efficient pro- 
cesses. The conditional Lindeberg condition can be verified (using Chebyshev's inequality or directly 
verifying the conditional Lyapunov condition) in the same way as before and we omit it here. The sum of 
conditional variances yields 
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Since for the shifted increments 
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holds, where the order is for time instants of average length N^^, the variances of the sums over all 
j S 3^1 and j € 3^3 are asymptotically equal. The variance of both uncorrected sums over maxima jj 
and minima 7^,+ distributed according to the fourth case is also asymptotically equal to the variances of 
those two addends. Only the asymptotic variance of the sum over all j £ 3^2 is bigger. For this reason the 
total asymptotic variance hinges on the asymptotic degree of regularity of the non-synchronous sampling 
scheme (T^ ,T^) defined in Definitional 
In the calculation of the asymptotic variance we have used that 

/=j-(ivfc) + l l=j + {i/\k) + l 

where the second remainder addend has an expectation equal to zero, and analogous formulae for (fj, for 
all 1 < i < Mn , 1 < fc < Afjv , i < j < N - {iy k). 

Furthermore, an application of the mean value theorem, Ito isometry and approximations in the same spirit 
as in the calculation of the asymptotic variance in the proof of the central limit theorem for the discretization 
errors of the estimators, lead to the Riemann sum in the calculation of the asymptotic variance above. The 
cross terms in (C' j)'^, ^ = 1, 2 are asymptotically negligible. Since in 3^4 repeating maxima ji = 7^+1 are 
considered only once, it holds true that |3^i| + 13^31 + 13^41+213^21 ^A^ — 3±1 (the last addend can appear 
due to boundary term effects). In the last step we have used that 

opt opt 

Y A fc) = 6/5 + 0(1) 

i,fce{l,...,Miv} 

when inserting the weights ( |A.l| i. 

From the analysis for the asymptotic discretization error of a one-scale subsampling estimator, we know 
that 

j=i/\k \ / 

holds true. Similarly, it can be deduced that 
1 ^ 



on As sumption |5] 
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□ 



B. Proof of Proposition |5.1 1 

Let Rf, RJ, Rf, Sf'^ and Sf'^ denote the number of times T^"\ < k < N, ^"^ and rj"^ in 
the bins [Gf ,Gf+i), [(/y)f , (/y)f+i), [(/x)f , (/x)f+i),0 < j < Kn - 1,. Define the generalized 
multiscale estimator in the fashion of ( [T2| ) 

MnU) opt Rf 
2—1 r— i 

for the increase of the quadratic covariation A [Jf, Y]qn and the univariate multiscale estimators 

M„(j) opt B'; M,Uj) opt Rf 



i=i 
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and A [XJ^^^^n , A [Y] analogously, where all binwise multiscale frequencies are of order \/NKn 

with possibly differing constants. Essential when considering the multiscale estimators on bins is that on 
Assumption [4] the distances between sampling times are of order N^^ ^ n^^ ~ m~^, whereas the 
numbers of observations i?' , S*' , 1 < j < Kjq in the specific bin are at most of order NKz}. Following 



the analysis for the four uncorrelated parts of the estimation error in sections Appendix A orders of the 
discretization variances 
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ik ^ iV2 
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Mn{j)/{nKN), Mm{j)/{mK]y),Mn{j) and Mn{j) , respectively. Cross terms are of order 
Rj' /{NMN{j)) ^ {MN(j)Ki\})^^ and analogous orders for the univariate estimators. The errors due to 
noise instead depend only on the number of observations in the considered interval. Therefore, the addends 
are of orders Rj' /Mff{j) ^ KnM^ (j) -^-^n^U) analogous for the univariate estimators. 

Choosing all multiscale frequencies M. (j) ^ iV^/^if^^ for every j, so that AfAr(-)iVV2 00, the error due 
to end-effects in the noise part and the discretization error dominate asymptotically the two other addends 
and are of order N~'/^K^''\ This holds as long as KnN-'/'' 0, such that MN{j){N/KNy^ as 
N ^ 00. 

The estimators ( |22a[ l-( |22d] l are consistent as — > 00 with K^N^'/^ — > as — > 00, since 
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and similar conclusions for the other three estimators. We have used that AGj^ ^ N ^ and apply 
the mean value theorem. is some value G^_i < G^ < G^ . Finally, elementary inequalities as 



EKjv 
.7 = 1 



< K)^ 



G" (T) ^ 
Kn - "^P|t-s|<Asupj. G^- 



and 



p.afaj G"(r) = Oa...(l)are 



involved in the approximations of the type above. Considering \22c\ and (|22d|, note that bin-widths chosen 
accordingly to ly are asymptotically of order K^^ in any interval of [0, T] on that the corresponding part 
of the integral / /y (t)((Tf^)2 dt is strictly positive. 

Denote R^ , k ~ 1 , . . . , 4, the orders of the approximation errors of the four above given integrals and their 
Riemann sums evaluated on the partition given K^ bins. The variance of the estimators 77^ and r/y for the 



noise variances are known to be E 
[3]from llD. From 



andE 



N ^ and hence O [N ^) on Assumption 



we derive that 
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and the same result for the one-scale estimator 
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